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SEQUENCE LISTING 

SEQIDNO:! 

Humaa IC-KFX cDNA sequence 

1 TTTCTGCX3CT GAGCCAGGGC ACCCCGGAGC CTGCGGCCTC CTTCCCCGCC 

51 CCTGCGGCCC CGGGTCCCAG CCCCGCCCCG CCCCGCCCCG GGCTGGGGCT 

101 CCGCTGGGGA ACCGGCCGAG CGGCGCGCGC GGAGGTGTCC GGCGGCCAGG 

151 AGGATGGCCA AGGTCCCGGA GCTGGAAGAC ACCTTCCTGC AGGCGCAGCC 

201 TGCGCCCCAA CTGTCCCCGG GGATCCAGGA AGACTGCTGT GTGCAGCTCC 

15 251 TGGGCAAGGG CTTGCTAGTC TATCCGGAAG AAACAGTGTA CCTGGCGGCC 

301 GAAGGGCAGC CCGGGGGCGA GCAGGGCGGC GGGGAGAAAG GCGAAGACCC 

351 GGAGCTGCCG GGGGCAGTGA AATCAGAAAT GCACTTAAAC AATGGTAACT 

20 

401 TTTCCTCTGA AGAAGAGGAC GCCGACAACC ACGACAGCAA AACCAAAGCA 

451 GCGGATCAAT ACCTGTCTCA GA?VGAAAACC ATCACGCAGA TTGTGAAGGA 

25 501 TAAAAAGAAG CAGACACAGC TCACGCTGCA GTGGCTTGAA GAGAATTACA 

551 TTGTATGTGA AGGAGTTTGC TTACCACGGT GCATTCTTTA TGCACACTAC 

601 TTAGATTTCT GTAGGAAAGA GAAATTAGAG CCAGCCTGTG CGGCCACCTT 

30 

651 TGGAAAGACA ATTCGCCAGA AGTTTCCCCT CCTAACAACA AGGCGGCTTG 

701 GAACAAGAGG CCATTCAAAG TATCATTACT ATGGGATTGG CATCAAAGAG 

35 751 AGCAGTGCAT ATTACCACTC CGTTTATTCT GGAAAGGGCT TGACAAGGTT 

801 TTCTGGAAGC AAGCTAAAGA ATGAGGGTGG CTTCACTCGT AAATATTCGC 

851 TTAGCTCAAA AACTGGAACA CTTCTTCCAG AATTCCCCAG CGCTCAACAC 

40 

901 CTTGTATACC AAGGATGCAT TTCTAAGGAC AAGGTTGATA CGCTCATAAT 

951 GATGTACAAA ACTCACTGCC AGTGTATCCT GGACAATGCA ATTAATGGAA 

45 1001 ACTTTGAAGA GATCCAGCAT TTTTTATTAC ACTTTTGGCA AGGAATGCCT 

1051 GACCATCTCC TTCCCCTGCT CGAAAATCCT GTTATCATTG ATATTTTCTG 

1101 TGTTTGTGAC TCAATTCTTT ATAAGGTTCT TACAGATGTA CTCATTCCTG 

1151 CAACAATGCA AGAAATGCCT GAAAGCTTAT TAGCAGACAT AAGAAATTTT 

1201 GCTAAAAATT GGGAACAGTG GGTTGTTTCA TCCTTGGAAA ACTTGCCAGA 
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1251 AGCTCTAACT GACAAGAAAA TACCTATTGT GCGAAGATTT GTATCTTCTC 

1301 TGAAACGACA AACATCTTTC TTACATCTTG CCCAGATTGC CAGACCAGCT 

1351 CTCTTTGACC AGCATGTCGT TAATTCTATG GTGTCTGATA TTGAAAGGGT 

1401 TGATTTGAAC AGCATTGGCT CTCAAGCCCT TCTTACCATT TCAGGCAGCA 

1451 CAGACACTGA ATCTGGTATC TACACTGAAC ATGACTCTAT CACTGTGTTC 

1501 CAAGAACTGA AGGATCTCCT TAAGAAGAAT GCCACTGTGG AGGCTTTTAT 

1551 TGAATGGTTG GATACTGTGG TAGAACAGAG AGTTATTAAG ACCAGCAAAC 

15 1601 AAAATGGAAG GTCATTAi\AG AAGAGAGCTC AAGACTTTCT GTTAAAGTGG 

1651 AGTTTTTTTG GTGCTCGAGT AATGCATAAT CTCACCTTGA ACAATGCATC 

1701 CAGTTTTGGT TCTTTTCATT TGATTCGAAT GCTTCTCGAT GAATACATTC 

20 

1751 TCCTGGCCAT GGAGACCCAG TTTAATAATG ACAAAGAGCA GGAGTTACAG 

1801 AATTTATTGG ACAAGTATAT GAAGAATTCA GATGCGAGTA AAGCTGCTTT 

25 1851 CACTGCTTCT CCGAGTTCAT GCTTTCTGGC CAACCGTAAT AAAGGGAGCA 

1901 TGGTTTCCAG CGACGCTGTG AAGAATGAAA GCCACGTGGA GACAACCTAT 

1951 CTCCCTCTGC CATCCAGTCA ACCTGGAGGC CTAGGCCCTG CTCTGCACCA 

30 

2001 GTTCCCTGCT GGGAACACAG ACAACATGCC GCTCACAGGT CAAATGGAGC 

2051 TTTCACAGAT TGCTGGTCAT CTGATGACAC CACCCATTTC TCCAGCCATG 

35 2101 GCAAGCCGAG GAAGTGTCAT TAACCAAGGA CCAATGGCAG GGAGGCCCCC 

2151 AAGTGTGGGC CCAGTACTGT CAGCTCCATC ACACTGCTCC ACATACCCAG 

2201 AGCCCATTTA TCCCACTCTC CCTCAAGCCA ATCATGACTT TTATAGCACC 

40 

2251 AGCTCTAACT ACCAGACTGT GTTTAGGGCA CAGCCCCACT CCACATCAGG 

2301 ACTCTATCCT CATCACACCG AGCATGGTCG ATGCATGGCT TGGACTGAAC 

45 2351 AGCAGCTTTC AAGA6ACTTC TTCAGTGGCA GCTGTGCGGG GTCTCCATAT 

2401 AACTCCCGGC CACCGTCTAG CTATGGCCCA TCCCTGCAAG CCCAGGATTC 

2451 ACACAATATG CAGTTTTTAA ATACAGGAAG CTTCAATTTC TTGAGCAACA 

50 

2501 CAGGAGCTGC CAGCTGCCAA GGAGCAACAC TGCCTCCTAA TTCACCAAAT 

2551 GGATACTATG GAAGCAACAT AAACTACCCA GAGTCTCACA GGCTCGGATC 

55 2601 AATGGTGAAT CAGCACGTTT CTGTCATCAG CAGCATTCGT TCACTGCCCC 
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2651 CCTACAGTGA CATCCACGAT CCACTTAACA TTTTAGATGA CAGTGGTAGA 

2701 AAACAGACCA GCTCGTTTTA CACAGACACA TCATCTCCAG TTGCATGTCG 

2751 AACTCCAGTC CTAGCTTCCA GTTTGCAAAC CCCAATTCCT TCTTCCTCAT 

2801 CCCAATGTAT GTATGGAACT TCCAACCAGT ATCCAGCTCA AGAAACCCTG 

2851 GACTCCCATG GAACAAGCAG TAGAGAAATG GTGTCCTCTT TACCACCTAT 

2901 CAACACTGTG TTCATGGGAA CAGCAGCTGG AGGCACTTAA ACCACCAATG 

2951 TGGGAGGGGG TGCTAAAACT TTAAA?\AAAA TCTCTACTGT GCAAATATCA 

3001 TTATTCACTC AGACTTCCAT AAGAGTAAAT AAAAAATGAA TATGCAGTSEQ ID 

NO:2 

Human IC-RFX polypeptide sequence. 



1 


MAKVPELEDT 


FLQAQPAPQL 


SPQIQEDCCV 


QLLGKGLLVY 


PEETVYIiAAE 


51 


GQPGGEQGGQ 


EKQEDPELPG 


AVKSEMHLNN 


GNFSSEEEDA 


DNHDSKTKAA 


101 


DQYLSQKKTI 


TQIVKDKKKQ 


TQLTXiQWLEE 


NYIVCEGVCL 


PRCILYAHYL 


151 


DFCRKEKLEP 


ACAATPGKTT 




RIiGTRGHSKY 


HYYGTGTKES 


201 


SAYYHSVYSG 


KGLTRFSGSK 


LKNEGGFTRK 


YSLSSKTGTL 


LPEFPSAQHIj 


251 


VYQGCISKDK 


VDTLIMMYKT 


HCQCILDNAI 


NGNFEEIQHF 


LIjHFWQGMPD 


301 


HLLPLLENPV 


IIDIFCVCDS 


ILYKVIiTDVL 


XPATMQEMPE 


SLLADIRNFA 


351 


KNWEQWWSS 


LENLPEAIiTD 


KKIPIVRRFV 


SSLKRQTSFIi 


HLiAQIARPAL 


401 


FDQHWNSMV 


SDIERVDLNS 


IGSQALLTXS 


GSTDTESGIY 


TEHDSITVFQ 


451 


ELKDLLKKNA 


TVEAFIEWIiD 


TWEQRVIKT 


SKQNGRSLKK 


RAQDFIjLKWS 


501 


FFGARVMHNIi 


TliNNASSFGS 


FHLIRMLLDE 


YILIiAMETQF 


NNDKEQELQN 


551 


LLDKYMKNSD 


ASKAAFTASP 


SSCFLANRNK 


GSMVSSDAVK 


NESHVETTYL 


601 


PLPSSQPGGXj 


GPALHQFPAG 


NTDNMPLTGQ 


MEIjSQIAGHL 


MTPPISPAMA. 


651 


SRGSVXNQGP 


MAGRPPSVGP 


VLSAPSHCST 


YPBPIYPTLP 


QANHDFYSTS 


701 


SNYQTVFRAQ 


PHSTSGLYPH 


HTEHGRCMAW 


TEQQLSRDPP 


SGSCAGSPYN 


751 


SRPPSSYGPS 


LQAQDSHNMQ 


FLNTGSFNFL 


SNTGAASCQG 


ATLPPNSPNG 


801 


YYGSNINYPE 


SHRLGSMVNQ 


HVSVISSIRS 


liPPYSDIHDP 


IiNILDDSGRK 


851 


QTSSPYTDTS 


SPVACRTPVL 


ASSLQTPIPS 


SSSQCMYGTS 


NQYPAQETIiD 


901 


SHGTSSREMV 


SSLPPINTVF 


MGTAAGGT 
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SEQ ID NO: 3 

RFX CONSENSUS SEQX7ENCE 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGXXXGGXXXX^^ 
XLNXGXXXSEXXXXXXHXSXXXXXXXXXXXKXXXXXXXXXKXXXTXXTLQWLEEN^ 
5 CXKXXXXPXXAAXFGKXIRQXFPXLTTRMiGTRGXSKYHYYGIXXKESSXYYXXXYSXK^ 

XXYSXXSKXGTLLPEFPXXQHXXXXXXIXXXKVXTLIMMYXTHCQXILDXXIXXNFXEXQXFL 
XiXXXXXXXDIXXVCDSILYKXXXDVLIPXXXQEXPXSLXXXIRXFAKNXXXWXXXSI^^ 
XXLKRQTSXXHLAQXXRXXLXXXXXVXXMXXDXXRVDIiNSIXXQAriXT^^ 
XXXXXXEXXIEWIiDTXVXQXVXKXXXXXXXSLKKXAQXFLIiXWSFFGXRVXXXLTI^^ 
10 LXXXEXXXXXXXXQEXXNXIiXXXMKXXXXXXXXXXXXXXXXXXXXXXXGXXXSJ^^ 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPXXSPAXXXXXXXXXXXXXXXXPPXXXXXXXXXXX^^ 
XXXXXXXXXICXXXSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPXX^ 
XXXXXXXXXXXXXXXXXXXLSNTGXXXXXXXXXXXXXXNGXXXXMXXXXXXXXXXXXXXXXX^^^ 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^ 

15 xxxxxxxxxxxxxxxx 



SEQ ID NO: 4 
DBD consensus 

20 TLQWIiXXNYXXXEGVXLPRXXLYXHYIiXXaCXXKLEPXXAAXFGKXIR^ 



SEQ ID N0:5 
B domain consensus 
25 VXXLXXXYXXHCXXILDXXXNXXFXXXXXXXXXFW 



SEQ ID NO: 6 

C domain consensus 

30 IjYXXXXXXLIPXXXXXXPXXLXXXIRXPAKXXXXWXXXXIi 



SEQ ID NO: 7 
Dimerization domain 
35 FXXXIiXRXTSXXHLAQXARXXLXXXXXXNXMXSDXXRVDXNXXXXQA^ 

XIiXXXXXXEXXXEWLDXVXXQXXXXXXYXXXXXXXKXAXXFLLKWSFXXXXVXX^ 
YXXXXXEXXXXXXXXXXL 



